# Wav2Vec2 fine-tuning
Wav2vec2 Ser English Finetuned
This model is fine-tuned based on the Wav2Vec2 architecture, specifically designed to recognize six emotional states (sadness, anger, disgust, fear, happiness, neutral) in English speech, with an accuracy of 92.42%.
Audio Classification English
W
dihuzz
68
1
My Awesome Mind Model
Apache-2.0
An audio classification model fine-tuned on the minds14 dataset based on the facebook/wav2vec2-base model
Audio Classification
Transformers

M
Gyaneshere
4
0
Baby Cry Classification Finetuned Babycry V4
Apache-2.0
A baby cry classification model fine-tuned based on wav2vec2-large-xlsr-53-english, achieving 81.5% accuracy
Audio Classification
Transformers

B
Wiam
120
2
W2v Speech Emotion Recognition
MIT
A Wav2Vec2-fine-tuned English speech emotion recognition model capable of identifying six emotional states
Audio Classification English
W
Khoa
147
0
Arabic Speech Syllables Recognition Using Wav2vec2
This is a Wav2Vec2-based Arabic syllable recognition model capable of identifying syllables in Modern Standard Arabic from speech.
Speech Recognition
Transformers Arabic

A
IbrahimSalah
78
1
Wav2vec2 Ljspeech Gruut
Apache-2.0
A phoneme recognition model based on the Wav2Vec2 architecture, fine-tuned on the LJSpeech Phonemes dataset, used to convert speech into phoneme sequences
Speech Recognition
Transformers English

W
bookbot
2,484
17
Wav2vec English Speech Emotion Recognition
Apache-2.0
English speech emotion recognition model fine-tuned based on Wav2Vec 2.0, capable of recognizing 7 different emotions
Audio Classification
Transformers

W
r-f
139.06k
19
Malaya Speech Fine Tune Realcase 30 Jun Lm
This model is a fine-tuned version of malay-huggingface/wav2vec2-xls-r-300m-mixed on the uob_singlish dataset, mainly used for speech recognition tasks.
Speech Recognition
Transformers

M
RuiqianLi
71
0
Trained French
Apache-2.0
This is a French speech recognition model fine-tuned based on facebook/wav2vec2-base-960h, achieving a word error rate of 1.0 on the evaluation set.
Speech Recognition
Transformers

T
eugenetanjc
151
0
Model Facebookptbrlarge
Apache-2.0
A Brazilian Portuguese speech recognition model fine-tuned on the Common Voice dataset based on Facebook's wav2vec2-large-xlsr-53-portuguese model
Speech Recognition
Transformers

M
Vkt
22
0
Wav2vec2 Base Common Voice 50p Persian Colab
Apache-2.0
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base for Persian language, supporting Persian speech-to-text tasks.
Speech Recognition
Transformers

W
zoha
21
0
Wav2vec2 Xls R 300m Mr Cv9 With Lm
Apache-2.0
An automatic speech recognition model fine-tuned on Marathi speech datasets based on Facebook's XLS-R-300M model
Speech Recognition
Transformers Other

W
anuragshas
23
0
English Filipino Wav2vec2 L Xls R Test 09
Apache-2.0
English-Filipino speech recognition model fine-tuned from jonatasgrosman/wav2vec2-large-xlsr-53-english, achieving a WER of 0.5750 on the evaluation set
Speech Recognition
Transformers

E
Khalsuu
29.03k
1
English Filipino Wav2vec2 L Xls R Test 06
Apache-2.0
This model is a fine-tuned version of jonatasgrosman/wav2vec2-large-xlsr-53-english on the filipino_voice dataset, designed for English and Filipino speech recognition tasks.
Speech Recognition
Transformers

E
Khalsuu
24
0
Gram Vaani Harveen Chadda Fine Tuning
MIT
This is a speech recognition model fine-tuned based on Harveenchadha/vakyansh-wav2vec2-hindi-him-4200, supporting Hindi speech-to-text tasks.
Speech Recognition
Transformers

G
nnair25
30
0
Output
Apache-2.0
Automatic speech recognition model fine-tuned on Mozilla Common Voice Portuguese dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers Other

O
tonyalves
28
0
Wav2vec2 Large Xlsr 53 Coraa Brazilian Portuguese Gain Normalization
Apache-2.0
This is a Wav2vec 2.0 model fine-tuned for Portuguese, trained on multiple Portuguese speech datasets including CORAA, CETUC, MLS, etc.
Speech Recognition
Transformers Other

W
alefiury
28
0
Wav2vec2 Xlsr Multilingual 53 Fa
A multilingual speech recognition model based on the wav2vec 2.0 architecture, specifically fine-tuned for Persian, significantly reducing word error rate
Speech Recognition
Transformers

W
masoudmzb
83
7
Wav2vec2 Base Vietnamese
Apache-2.0
Vietnamese speech recognition model based on Wav2Vec2 architecture, fine-tuned on VSLP dataset, supports 16kHz sampled speech input
Speech Recognition
Transformers Other

W
dragonSwing
16
2
Wav2vec2 Large Xlsr Turkish
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Turkish Common Voice dataset based on the facebook/wav2vec2-large-xlsr-53 model, achieving a test WER of 21.13%.
Speech Recognition Other
W
cahya
61
2
Wav2vec2 Large Xlsr Rm Sursilv
Apache-2.0
This is an automatic speech recognition model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, specifically designed for recognizing the Sursilvan dialect of Romansh.
Speech Recognition
W
gchhablani
27
0
Wav2vec2 Xls R 300m Lm Hebrew
Apache-2.0
Hebrew speech recognition model fine-tuned from facebook/wav2vec2-xls-r-300m with n-gram language model enhancement
Speech Recognition
Transformers Other

W
imvladikon
21
1
Wav2vec2 Large Xlsr Breton
Apache-2.0
A speech recognition model fine-tuned on the Breton Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Other
W
cahya
25
1
Wav2vec2 Large Xlsr 53 Telugu
Apache-2.0
A Telugu speech recognition model fine-tuned based on the facebook/wav2vec2-large-xlsr-53 model, trained using the OpenSLR SLR66 dataset
Speech Recognition Other
W
anuragshas
44.24k
5
Xls R Spanish Test
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the Spanish Common Voice 7 dataset, based on the facebook/wav2vec2-large-xlsr-53 model.
Speech Recognition
Transformers Spanish

X
pablouribe
29
0
Wav2vec2 Large Xlsr Greek 1
Apache-2.0
A speech recognition model fine-tuned on Greek language based on facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input.
Speech Recognition
Transformers Other

W
skylord
15
0
Custom German
Apache-2.0
German speech recognition model fine-tuned based on flozi00/wav2vec-xlsr-german
Speech Recognition
Transformers

C
chaitanya97
24
0
Xls Asr Vi 40h
Apache-2.0
This model is a speech recognition model fine-tuned on the Common Voice 7.0 Vietnamese dataset and private datasets based on facebook/wav2vec2-xls-r-300m.
Speech Recognition
Transformers Other

X
geninhu
14
0
Wav2vec2 Large XLSR 53 Assamese
Apache-2.0
Assamese automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained using the Common Voice dataset
Speech Recognition Other
W
infinitejoy
260
0
Wav2vec2 Large Xlsr Greek 2
Apache-2.0
A speech recognition model fine-tuned on the Greek Common Voice dataset based on facebook/wav2vec2-large-xlsr-53, balancing the training set with synthesized female voice data
Speech Recognition
Transformers Other

W
skylord
15
0
Output
Apache-2.0
This model is an automatic speech recognition (ASR) model fine-tuned on the Turkish COMMON_VOICE dataset based on cahya/wav2vec2-base-turkish-artificial-cv
Speech Recognition
Transformers Other

O
cahya
23
0
Bp500 Xlsr
Apache-2.0
This is a Wav2vec 2.0 model fine-tuned for Brazilian Portuguese, trained on multiple Brazilian Portuguese datasets, achieving a WER of 13.6 on the Common Voice test set.
Speech Recognition
Transformers Other

B
lgris
21
1
Wav2vec2 Xlsr Punjabi
Apache-2.0
An automatic speech recognition model fine-tuned for Punjabi using the Common Voice dataset, based on facebook/wav2vec2-large-xlsr-53
Speech Recognition
W
gagan3012
2,433
0
Xls R 300m Sv
Apache-2.0
Automatic speech recognition model fine-tuned on Swedish dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers Other

X
hf-test
28
3
Bp Voxforge1 Xlsr
Apache-2.0
This is a Wav2Vec2 model fine-tuned for Brazilian Portuguese speech recognition tasks, trained on the VoxForge dataset.
Speech Recognition
Transformers Other

B
lgris
21
0
Wav2vec2 Large Voxrex Npsc
An automatic speech recognition model fine-tuned on the NBAILAB/NPSC - 16K_MP3 dataset based on KBLab/wav2vec2-large-voxrex
Speech Recognition
Transformers

W
NbAiLab
37
0
Xls R Hausa 40
Apache-2.0
Hausa automatic speech recognition model based on wav2vec2-xls-r-300m architecture, fine-tuned on Common Voice 8.0 Hausa dataset
Speech Recognition
Transformers Other

X
Mofe
22
1
Featured Recommended AI Models